Sense amplifier architecture for sliding banks for a simultaneous operation flash memory device

ABSTRACT

A flash memory device is provided with a two stage sense amplifier. The two stage sense amplifier includes a sense pre-amplifier coupled to a sense output amplifier. The sense pre-amplifier amplifies a data signal from a memory bank. The sense output amplifier then differentially compares the output of the sense pre-amplifier with a reference signal. An embodiment with first and second memory banks is also provided that is capable of simultaneous operation.

COPYRIGHT NOTICE

A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.

BACKGROUND

Computers, personal digital assistants, cellular telephones and other electronic systems and devices typically include processors and memory. The memory is used to store instructions (typically in the form of computer programs) to be executed and/or data to be operated on by the processors to achieve the functionality of the device. In some applications, the systems and devices may require that the instructions and/or data be retained in some form of a permanent/non-volatile storage medium so that the information is not lost when the device is turned off or power is removed. Exemplary applications include computer BIOS storage and diskless handheld computing devices such as personal digital assistants.

One way to provide such non-volatile storage capability is to include a mass-storage device such as a hard disk drive. Hard disk drives are mechanical devices which store data on rotating magnetic platters. However, such devices may be difficult to fit in small systems and may have significant reliability, cost and manufacturing constraints. An alternative to such devices are integrated-circuit based non-volatile memories. One type of non-volatile memory that can be used is Erasable Programmable Read Only Memory (“EPROM”). While conventional EPROM's provide reliable non-volatile storage, they may not be able to be reprogrammed in the field in a practical manner. For example, EPROM's typically require exposure to ultraviolet light to erase them which may require that the EPROM memory chips be removed from the device. Once erased and reprogrammed, they are placed back in the device. In many applications, removing the memory to reprogram the device is not practical. In addition, besides not being easily reprogrammed, EPROM's may not have satisfactory data storage densities.

To avoid the complexity of EPROM's and to provide a device that can be reprogrammed in the field, many electronic designs use Electrically Erasable Programmable Read Only Memory (“EEPROM”), Static Random Access Memory (“SRAM”) or flash memory, which can be reprogrammed electrically and without special hardware. SRAM is not technically a form of non-volatile memory but can be used in some applications requiring non-volatile capability.

EEPROM has the disadvantages of being expensive and having a very limited life cycle, i.e. an EEPROM can only be erased and rewritten a limited number of times before the device becomes non-functional. SRAM offers high operating speeds but only maintains its contents as long as power is supplied, therefore requiring a battery or other power source. This necessitates additional hardware to maintain power to the SRAM to preserve the stored contents which increases manufacturing cost and complexity. Further, the additional hardware may put undesirable constraints on the physical size of the design. In addition, EEPROM's and SRAM's may not have as high a data storage density as compared to other forms of storage. Therefore, where cost, size or density is a factor, flash memories are preferred because they may be simpler to reprogram in the field then EPROM's, less expensive than EEPROM's, easier to implement than battery-backed SRAM's and available in higher data storage densities.

Flash memory (or flash RAM) is a form of non-volatile storage which uses a memory cell design with a floating gate. High voltages are applied to the memory cell inputs to program/store charge on the floating gate or to erase/remove charge from the floating gate. Programming occurs by hot electron transfer to place charge on the floating gate while erasure makes use of Fowler-Nordheim tunneling in which electrons pierce through a thin dielectric material, reducing the amount of electronic charge on the floating gate. Erasing a cell sets the logical value of the cell to “1” while programming the cell sets the logical value to “0”. Aside from programming or erasing operations, a flash memory operates similarly to a randomly accessible read only memory (ROM). Conventionally, a flash memory chip, including the flash memory storage cells and support logic/circuitry, is made by fabricating layers of semiconductor material and interconnect layers of polysilicon and first and second metal layers onto a substrate. It will be appreciated that there are numerous integrated circuit fabrication techniques, involving more or fewer layers, which are applicable herein.

Prior flash memories could only be erased by erasing the entire memory chip also known as bulk erasure. Byte by byte erasure was not possible. To somewhat alleviate this problem, modern flash memory is typically divided logically into blocks called “sectors” where each sector contains a portion of the total bytes of data storage available. For example, a typical flash memory may have 32 megabits of total storage and be logically broken down into 64 sectors, each sector containing 64 Kilobytes of data (one byte being equal to eight bits). This arrangement allows for the option of erasure of one sector at a time in addition to bulk erasure of the entire memory. While typical flash memories are still incapable of byte by byte erasure, data in the flash memory may still be programmed byte by byte (or sometimes word by word, where a word equals four bytes) depending on the implementation. It will be appreciated that the granularity by which a flash memory device can be programmed or erased may vary and that granularities down to bit level programming/erasure are contemplated.

In order to program and/or erase a flash memory, typically a complex process must be followed. For example, before erasing a particular sector, that sector must be programmed (known as “pre-programming”). These steps of erasing and programming involve complex application of high voltages to the memory cells for specified periods of time and in particular sequences. Many flash memories provide embedded state machines which perform the complex programming and erasing operations automatically. These processes of programming and erasing a flash memory may take a long time to complete. A typical erase sequence can take anywhere from 0.7 seconds up to 15 seconds per sector. To erase an entire chip can take up to 49 seconds depending on the number of sectors. While programming is much faster, on the order of 7 to 300 microseconds per byte, it is still slow compared to other memory devices. Programming an entire chip can still take up to 120 seconds (including the time to verify the data) depending on the capacity of the chip. Typically, standard Dynamic Random Access Memory (“DRAM”) offers write access times on the order of nano-seconds, a difference between flash memory of many orders of magnitude.

This complex nature of programming and erasing flash memory devices leads to a major problem in that they do not provide sufficiently fast write access which then affects read accesses. For example, conventional flash memory devices typically do not allow a processor to perform a read operation while a program or erase operation is underway in the flash memory device. In most implementations, the processor is required to periodically poll a status register of the flash memory device to detect the end of the program or erase operation before initiating a read operation to the flash memory device.

Unfortunately, as noted above, the programming and erase cycle times for typical flash memory devices are orders of magnitude greater than acceptable write access times of a conventional random access main memory using, for example, Dynamic Random Access Memory (“DRAM”). Such long latencies associated with programming or erase operations can lock up the operating system and prevent the system from functioning for unacceptably long time intervals if the flash memory is the only memory in the electronic system. Some prior flash memories allow erase suspend operations in order to address this problem. Erase suspend allows the processor to pause an erase operation so another sector can be read. However, such memories typically still impose a suspend latency interval of several microseconds before a read operation can be initiated. A typical suspend latency interval is from 0.1 to 20 microseconds.

Prior systems may employ multiple flash memory devices in an attempt to prevent such operating system lock up. In such systems, the processor usually has read access to one of the flash memory devices while other flash memory devices are undergoing a program or erase operation. However, such systems typically suffer from high costs because multiple flash memory devices are implemented even though the capacity of a single flash memory device may accommodate the needs of the particular electronic device.

Another prior art system uses a flash memory in combination with an EEPROM memory. This system allows a read operation of one of the memories while writing to the other. However, the size of an EEPROM memory cell is significantly larger than that of a flash memory cell which reduces the amount of storage that can be placed on the memory chip. Further, there are significant design and manufacturing complexities involved with integrating two different memory technologies on the same chip. Therefore, a device which uses an EEPROM in combination with a flash memory will typically be more expensive both to design and manufacture.

In addition, programming and erasing a flash memory involves higher than normal voltages as compared to performing read operations. The use of these higher than normal voltages can cause problems when trying to implement the capability to simultaneously read while programming/erasing. Such problems include difficulties in distributing the high voltages required for the program and erase operations along with normal voltage for read operations and handling increased noise induced on the read sense outputs by the use of high voltages elsewhere within the device. Further, depending on the implementation, redundant logic may also be employed which introduces further complexities.

Accordingly, there is a need for an efficiently designed and manufacturable flash memory device that allows simultaneous read and write operations.

SUMMARY OF THE INVENTION

The present invention is defined by the following claims, and nothing in this section should be taken as a limitation on those claims. By way of introduction, the preferred embodiments described below relate to upper and lower sense amplifiers for a flash memory device capable of simultaneous operation and having an upper memory bank of flash memory cells and a lower memory bank of flash memory cells. Each of the sense amplifiers includes a sense pre-amplifier coupled to an upper or lower memory bank and generating a sense pre-amplifier output signal. Each sense amplifier also includes a sense output amplifier coupled to the sense pre-amplifier and generating a sense output amplifier signal in response to the sense pre-amplifier output signal.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a block diagram of a flash memory chip according to the present invention that is capable of simultaneous reading and writing;

FIG. 2 depicts one of the upper read sense amplifiers.

FIG. 3 depicts the circuitry of the upper sense pre-amplifier.

FIG. 4 depicts the circuitry of the sense output amplifier.

DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EMBODIMENTS

Herein, the phrase “coupled with” is defined to mean directly connected to or indirectly connected with through one or more intermediate components. Referring now to the Figures and in particular, FIG. 1, there is schematically shown a flash memory device 100 according to the present invention that provides for reading while simultaneously undergoing a program or erase operation. The memory device 100 according to the present invention may include one or more components of the memory devices disclosed in U.S. Pat. No. 5,867,430 entitled “BANK ARCHITECTURE FOR A NON-VOLATILE MEMORY ENABLING SIMULTANEOUS READING AND WRITING,” to Chen et al and U.S. Pat. No. 5,847,998 entitled “NON-VOLATILE MEMORY ARRAY THAT ENABLES SIMULTANEOUS READ AND WRITE OPERATIONS,” to Van Buskirk, both of which are herein incorporated by reference and further describe the implementation and operation of a device of this type. The memory device 100 may also include one or more components of such exemplary flash memory devices capable of simultaneous read and write operation as the Am29DL162C and Am29DL163C 16 megabit (“Mb”) flash memory chips and the Am29DL322C and Am29DL323C 32 Mb flash memory chips manufactured by Advanced Micro Devices, Inc. located in Sunnyvale, Calif. For more detail regarding these exemplary flash memory chips, refer to “Am29DL322C/Am29L323C 32 Megabit (4M×8-Bit/2M×16-Bit) CMOS 3.0 Volt-only, Simultaneous Operation Flash Memory” Datasheet and “Am29DL162C/Am29L163C 16 Megabit (2M×8Bit/1M×16-Bit) CMOS 3.0 Volt-only, Simultaneous Operation Flash Memory” Datasheet. While the exemplary devices disclosed above have capacities of 16 or 32 Mb, it will be appreciated that the embodiments disclosed herein are equally applicable to devices with higher bit densities such as 64 or 128 Mb devices.

In a typical embedded application of the above exemplary simultaneous operation capable flash memory 100, the available data storage space can be structured to store data and boot code in one bank and control code in another bank. The control code, which can contain command sequences which tell one bank, for example, bank 196, to program/erase data sectors, can reside as executable code in the alternate bank, for example bank 194. While the first bank is being programmed/erased, the system can continue to execute code from the alternate bank to manage other system operations. Similarly, depending on the system implementation, the CPU can also execute code from the first bank while the alternate bank undergoes a program/erase. There is no bank switching latency and no need to suspend the program/erase operation to perform the read. This minimizes the CPU's read/write cycle time, maximizes data throughput and reduces overall system cost by eliminating the need for additional hardware. It will be appreciated that while the exemplary devices have two banks of memory cells, devices with more than two banks are contemplated.

Again referring to FIG. 1, the memory device 100, according to an embodiment of the present invention with a capacity of 32 Mb and operating in word addressable mode, includes a 21 bit address input 102, a 16 bit data input/output (“DATA”) 192, power inputs (not shown in FIG. 1) and control inputs (not shown in FIG. 1). It will be appreciated that the memory device 100 with a capacity of 16 Mb only requires 20 address bits and when operating in byte mode, the 32 Mb device 100 requires 22 address bits and the 16 Mb requires 21 address bits. The control inputs include Chip Enable, Output Enable, and Write Enable. The Chip Enable signal activates the chip's control logic and input/output buffers. When Chip Enable is not asserted, the memory device operates in standby mode. Output Enable is used to gate the outputs of the device through I/O buffers during read cycles. Write Enable is used to enable the write functions of the memory device. In one embodiment, all of the components of FIG. 1 are contained on a single integrated circuit chip. Note that address and control inputs for the exemplary flash memory chips are dependent on memory density and interface implementations. It will be appreciated that the disclosed embodiments can work with higher memory densities and alternate interface implementations with their accompanying alternate address and control input configurations.

The memory device 100 further includes address buffer 104, address multiplexers 106 and 108, address sequencer 110, X logical address decoders 112 and 118, Y logical address decoders 114 and 120, memory array Banks 0 and 1 denoted as 194 and 196 respectively, Dpump 160, data multiplexers 170 and 172, read sense amplifiers 174, verify sense amplifiers 176, negative pump 190, output multiplexer 180, state machine and control logic 122, input/output buffers 182, VPPIG Pump 142, booster 0 denoted as 132, VPXGG Pump 134, booster 1 denoted as 136 and power multiplexers 130, 138, 140 and 144. The address input 102 is received by the address buffer 104, which sends the address to the address multiplexer 106 for bank 194 and the address multiplexer 108 for bank 196. The address sequencer 110 is controlled by the state machine and control logic 122. In one embodiment, the address sequencer 110 is part of the state machine and control logic 122. The output of the address sequencer 110 is an address which is sent to both multiplexer 106 and multiplexer 108. The address sequencer 110 is used to generate sequential addresses during an erase sequence. The output of the multiplexer 106, upper address UA, is communicated to the X address decoder 112 and the Y address decoder 114. The output of the multiplexer 108, lower address LA, is sent to the X address decoder 118 and the Y address decoder 120. The multiplexer 106 chooses between the address from the buffer 104 and the address from the address sequencer 110 in response to a control signal B0_SEL.

The multiplexer 108 chooses between the address from the address buffer 104 and the address from address sequencer 110 based on a control signal B1_SEL. The selection signals B0_SEL and B1_SEL are generated by the state machine and control logic 122.

Bank 194 and bank 196 are arrays (or sets) of flash memory cells (operation of these individual flash memory cells is discussed in more detail below). The banks 194, 196 are organized by words and then by sectors and can either be byte or word addressable. It will be appreciated by those skilled in the art that other types of non-volatile memory are also within the scope of the present invention. The address decode logic for bank 194 includes the X address decoder 112 and the Y address decoder 114. The X address decoder 112 includes a word line decoder and sector decoder. The word line decoder receives address bits UA[6:14] and the sector decoder receives address bits UA[15:20]. The Y address decoder 114 includes a bit line decoder and Y bit line gating. The bit line decoder receives address bits UA[0:5].

The address decode logic for bank 196 includes the X address decoder 118 and the Y address decoder 120. The X address decoder 118 includes a word line decoder and a sector decoder. The word decoder receives address bits LA[6:14] and the sector decoder receives address bits LA[15:20]. The Y address decoder 120 includes a bit line decoder and Y bit line gating. The bit line decoder receives address bits LA[0:5]. In one embodiment, the address buffer 104 includes a latch to store the address being decoded. In another embodiment, the latch can be part of the decoders 112, 114, 118, 120.

FIG. 1 further shows a multiplexer 130 with three inputs: booster zero 132, VPXGG pump 134 and Vcc. The VPXGG pump 134 is a positive power supply for generating and supplying a regulated positive potential to the control gate of selected flash memory cells via the word lines. Many different voltage pumps known in the art are suitable for use in the present invention. A more detailed explanation of one technology which can be included in VPXGG pump 134 can be found in U.S. Pat. No. 5,291,446, “VPP POWER SUPPLY HAVING A REGULATOR CIRCUIT FOR CONTROLLING A REGULATED POSITIVE POTENTIAL” to Van Buskirk et al, the entire contents of which are incorporated herein by reference. Booster 132 is used to boost the word line during reads. The multiplexer 130 receives a selection signal 197 from state machine and control logic 122 and chooses one of its three inputs to send to the word lines of bank 194 via the X address decoder 112. The output of the multiplexer 130 is labeled as VPXG0. FIG. 1 is drawn to show the three inputs 132, 134 and Vcc connected to a multiplexer in order to simplify the disclosure. A more detailed description of one exemplary implementation can be found in U.S. Pat. No. 5,708,387, “FAST 3-STATE BOOSTER CIRCUIT”, to Cleveland et al, the entire contents of which are incorporated herein by reference. Many booster circuits and selection circuits known in the art are suitable for use in the present invention.

FIG. 1 also includes another multiplexer 138 having three inputs: booster one denoted as 136, VPXGG pump 134 and Vcc. Booster 136 is similar to booster 132. The multiplexer 138 operates in a similar fashion to multiplexer 130, and receives its selection signal 198 from the state machine and control logic 122. The output of multiplexer 138 is VPXG1 which is sent to the word lines of bank 196 via the X address decoder 118. The purpose of the multiplexers 130 and 138 is to switch between the three power lines depending on the operation being performed on the particular bank of memory cells.

The VPPIG pump 142 is a high voltage pump used to pass high voltage to the drain of the memory cells. The output of the VPPIG pump 142 is sent to multiplexer 140 and multiplexer 144. Both multiplexers also have Vcc as an input. Multiplexers 140 and 144 switch between inputs based on signals 195 and 199 from the state machine and control logic 122. The output of multiplexer 140 is VPPI0 and the output of multiplexer 144 is VPPI1. During a normal read operation, VPPI1 and VPPI0 are connected to Vcc. VPPI0 is connected to the gate of an N-channel transistor 152. VPPI1 is connected to the gate of an N-channel transistor 154. The source of transistor 152 is connected to Y address decoder 114, multiplexer 170 and multiplexer 172. The drain of transistor 152 is connected to the Dpump 160 and the drain of transistor 154. The Dpump 160 is a drain power supply. Various drain power supplies, known in the art, can be used for the present invention. One exemplary drain pump is disclosed in U.S. Pat. No. 5,263,000, “DRAIN POWER SUPPLY”, to Van Buskirk, et al., the entire contents of which are incorporated herein by reference. The source of transistor 154 is connected to multiplexer 170 and multiplexer 172. The source of transistor 154 is also connected to Y address decoder 120 for purposes of accessing the bit lines in bank 196. The connections to multiplexers 170 and 172 provide a path for reading data from bank 194 and bank 196. Multiplexer 170 uses the signal RSA_SEL from the state machine and control logic 122 to selectively choose one of the two input signals to communicate to the read sense amplifiers 174. Multiplexer 172 uses the selection signal VSA_SEL from the state machine and control logic 122 in order to selectively communicate one of its two input signals to the verify sense amplifiers 176. Thus, the two transistors (152 and 154) and the two multiplexers (170 and 172), are used to selectively pass voltages to the drains of selected cells in bank 194 or bank 196 and to selectively read data from either bank 194 or bank 196. For the sake of clarity, the implementation of multiplexers 170 and 172 is illustrative only. Some of the implementation details are not shown in FIG. 1. In the memory device 100, there are actually two sets of sense amplifiers, one set for each bank 194, 196. There are also two sets of verify sense amplifiers. Data from the banks is multiplexed from each bank 194 or 196 to either its read sense amplifier or its verify sense amplifier. When a bank 194 or 196 is using its read sense amplifier, its verify sense amplifier is turned off and vice versa. It will be appreciated that there are many ways to multiplex multiple data sources among multiple destinations.

Data from either bank 194 or bank 196 can be communicated to either the read sense amplifiers 174 or the verify sense amplifiers 176. Both sense amplifiers are in communication with the state machine and control logic 122. While data from bank 194 is communicated to the read sense amplifiers 174, data from bank 196 can be communicated to the verify sense amplifiers 176. While data from bank 194 is communicated to the verify sense amplifiers 176, data from bank 196 can be communicated to the read sense amplifiers 174. The output of the verify sense amplifiers 176 is sent to the state machine and control logic 122, which is used to verify that a particular byte has been programmed or erased. Note that in the memory device 100, the preferred implementation of the read sense amplifiers 174 provides two sets of sense amplifiers, one for each bank 194, 196. Only the sense amplifiers for the bank 194 or 196 undergoing a read operation are active during the read operation. The verify sense amplifiers 176 of the memory device 100 also have two sets of verify amplifiers, one for each bank.

Data from the read sense amplifiers 174 is sent to multiplexer 180. A second input of the multiplexer 180 includes device 100 status information from the state machine and control logic 122 such as whether or not a program or erase is in progress. The selection signal for multiplexer 180 is provided by the state machine and control logic 122.

I/O buffers 182 are used to pass data out and receive data into memory device 100. While a read is being performed on one of the banks, multiplexer 180 will communicate output data from read sense amplifiers 174 to I/O buffers 182. During an erase or program sequence, multiplexer 180 will communicate status information to I/O buffers 182 so that an outside processor can poll the memory device 100 for the status in regard to the erase or program.

The memory device 100 also includes a negative pump 190 that is used to generate a relatively high negative voltage to the control gates of selected memory cells via the word lines of either bank 194 or bank 196, as selected by the state machine and control logic 122. The negative pump 190 is in communication with the X address decoders 112 and 118. One example of a negative pump can be found in U.S. Pat. No. 5,612,921, “LOW SUPPLY VOLTAGE NEGATIVE CHARGE PUMP”, to Chang et al, the entire contents of which are incorporated herein by reference.

The state machine and control logic 122 provides the control for read, program and erase operations. Many of the selection lines used to select between bank 194 and bank 196 are controlled by the state machine and control logic 122. Alternatively, the output from the X and Y address decoders 112, 114, 118, 120 can be used to select between banks of memory cells.

The memory device 100 is programmed using an embedded programming sequence and is erased using an embedded erase sequence. The embedded sequences allow a processor to initiate a program or erase sequence and perform other tasks while the program and erase sequences are being carried out. The embedded program and erase sequences are controlled by the state machine and control logic 122, which uses a command register to manage the commencement of either sequence. The erase and programming operations are only accessed via the command register which controls an internal state machine that manages device operations. Commands are written to the command register via the data inputs 192 to the memory device 100.

While one bank is being programmed, the other bank can be accessed for a read operation. For example, during a program of a byte in bank 196, the state machine and control logic 122 would cause multiplexer 108 to select the address from buffer 104 for communication to decoders 118 and 120. Further, the state machine and control logic 122 would store the data byte to be programmed from the I/O buffers 182 for verification when the programming completes. The output of bank 196 would be sent to the verify sense amplifiers 176 via multiplexer 172 for comparison with the stored input data. During a simultaneously initiated read operation to bank 194, the state machine and control logic 122, after storing away the data to be programmed, instructs multiplexer 106 to select the address from the buffer 104 for communication to the X and Y address decoders 112 and 114. The output of bank 194 would be sent to the read sense amplifiers 174 via multiplexer 170. The output of the read sense amplifiers 174 would be sent, via multiplexer 180, to the I/O buffers 182 and then to the data bus 192.

Similarly, during an erase of a sector in bank 194, the state machine and control logic 122 would cause multiplexer 106 to select the addresses from the address sequencer 110. The address sequencer 110 would be used to cycle through all the bytes in a particular sector to make sure that each byte is preprogrammed. The sector is subsequently bulk erased. After erasure, the address sequencer 110 would be used to generate addresses to verify each byte of this erased sector. While bank 194 is being erased and multiplexer 106 is selecting an address from the address sequencer 110, a read operation can be carried out in bank 196 by using multiplexer 108 to select the address from the buffer 104 rather than an address from address sequencer 110. During the verify operation of the erase method for bank 194, the state machine and control logic 122 would be verifying the data using the verify sense amplifiers 176, while read data from bank 196 would be communicated to the read sense amplifiers 174. Thus, each bank has two input address paths and two output data paths that can be multiplexed so that either bank can be read from while the other bank is simultaneously being written to.

In the memory device 100, each memory cell, within the banks 194 or 196, includes a nor-type floating gate transistor. It will be appreciated by those skilled in the art, however, that there are many ways to implement a flash memory cell and that the configurations and operating characteristics may vary. It will further be appreciated that the embodiments disclosed herein are generally applicable and not limited to one particular implementation of a flash memory cell. The exemplary transistor has three connections called the source, drain and control gate. In a typical flash memory array, the control gates of the memory cells are connected to the word lines of the array which are used to address the data stored in the array. The sources are selectively connected to ground (for a read operation) depending on which bits are to be read. The drains are connected to the bit lines which are used to sense/read the stored data out of the array.

During an erase operation, the source input of the memory cell transistor is connected to a high positive voltage, the drain/bit line is left to float and the control gate/word line is connected to a relatively high negative voltage supplied by the negative pump 190. An exemplary high positive voltage applied to the source during an erase is approximately 5 volts and an exemplary high negative voltage applied to the control gate/word line by the negative pump 190 is approximately minus 9 volts although other voltages and input combinations can be used. Based on this input configuration, any charge stored on the floating gate of the memory cell transistor will discharge by flowing out to the source due to Fowler-Nordheim Tunneling.

During a program operation, the source input of the memory cell transistor is connected to ground, the drain/bit line is connected to a high positive voltage provided by the VPPIG Dpump drain power supply 142 and the control gate/word line is connected to a high voltage provided by the VPXGG pump positive power supply 134. An exemplary high voltage applied to the drain by the VPPIG 142 is approximately 5 Volts while an exemplary high voltage applied to the control gate by the VPXGG 134 pump is approximately 9 Volts. It will be appreciated by those skilled in the art that other voltage and input combinations can also be used. Based on this input configuration, charge will flow by hot electron transfer to the floating gate of the memory cell transistor and accumulate there.

While programming and erasing the memory cell require higher than normal voltages, reading from the cell only requires the availability of the normal supply voltage. To read from the memory cell, the source is connected to ground (also referred to as Vss) and the control gate/word line are connected to the booster power supplies 132, 136. Prior to selecting the transistors for a read, the bit lines are charged up via the Dpump 160. When the cells turn on (if erased), they will connect their respective bit line to ground, grounding out the bit line. The current value of the memory cell is then sensed from the drain/bit line connection. There is a booster power supply 132 for bank 194 and a booster power supply 136 for bank 196. The booster power supplies 132, 136 are used to boost the word lines of bank 194 or bank 196 during a read operation. An exemplary Vcc supply voltage is 3.0 Volts although other supply voltages are known in the art. An exemplary booster voltage is 5.0 Volts, although the use of the other voltages on the control gate for read operations is possible. If there is charge stored on the floating gate, i.e. the memory cell has been programmed, the flow of current from the drain to the source (ground) will be inhibited and the memory cell will read as a logical “0”. If the memory cell has been erased, there will be no charge stored on the floating gate and with a voltage applied to the control gate greater than the threshold voltage of the transistor, current will flow from the drain to the source and the memory cell will read as a logical “1”. Note that a transistor that is on, grounds its respective bit line. Data read out of the array is considered in its complimentary form, therefore the grounded bit lines are interpreted as logical 1's and the non-grounded bit lines are considered logical 0's.

Application of the particular voltages necessary for each operation is handled by the state machine and control logic 122. This logic 122 controls the multiplexers 130, 138, 140, 144 that place the proper voltages from the various power supplies 132, 134, 136, 142 and Vcc on the memory cell inputs depending on the desired function.

While the total capacity of the simultaneous read and write capable flash memory device 100 is 16 or 32 MB or higher, how that capacity is distributed among the available banks is variable. Users of simultaneous read and write capable flash memory may need different bank partition sizes depending on their applications. To meet the varying needs of users, the flash memory device 100 preferably implements a sliding bank architecture. This architecture allows the simplified design and manufacture of simultaneous flash memory devices with varied bank partition sizes. To alter the bank sizes, only a single metal layer of the chip needs to be altered. For a more detailed discussion of the sliding bank architecture, refer to co-pending and commonly assigned U.S. patent application Ser. No. 09/159,142, “SIMULTANEOUS OPERATION FLASH MEMORY DEVICE WITH A FLEXIBLE BANK PARTITION ARCHITECTURE”, filed Sep. 23, 1998, U.S. patent application Ser. No. 09/159,029, “METHOD OF MAKING FLEXIBLY PARTITIONED METAL LINE SEGMENTS FOR A SIMULTANEOUS OPERATION FLASH MEMORY WITH A FLEXIBLE BANK PARTITION ARCHITECTURE”, filed Sep. 23, 1998 and U.S. patent application Ser. No. 09/159,489, “BANK SELECTOR CIRCUIT FOR A SIMULTANEOUS OPERATION FLASH MEMORY DEVICE WITH A FLEXIBLE BANK PARTITION ARCHITECTURE”, filed Sep. 23, 1998, the entire contents of each of which are incorporated herein by reference. The sliding bank architecture enables the memory device 100 to be produced in many different configurations with only a single mask change in one of the final steps of production. In the case where the flash memory device 100 has a capacity of 32 megabits (Mb), partitions where bank 194 has a capacity 4 or 8 Mb and bank 196 has a capacity of 28 or 24 Mb respectively, can be used. In the case where the flash memory device 100 has a capacity of 16 Mb, partitions where bank 194 has a capacity of 2 or 4 Mb and bank 196 has a capacity of 14 or 12 Mb respectively, can be used. This has the advantages that many different configurations of the flash memory device 100 can share much of the same basic design, process and manufacturing expense.

Turning to the present invention and FIG. 1, the flash memory device 100 includes an array of memory cells or banks 194, 196 for storing bits of data. Two sets of read sense amplifiers 174, one upper 174U and one lower 174L, read the contents of these upper memory cells 194 and lower memory cells 196. The memory device 100 employs a multitude of sense amplifiers 174, so that one read sense amplifier 174 is provided for each of the memory device's data input/output pins 192. The flash memory device 100 may have both byte (eight bits) and word (sixteen bits) versions available where sixteen read sense amplifiers 174 (eight upper and eight lower) and thirty-two read sense amplifiers 174 (sixteen upper and 16 lower), respectively, are required.

FIG. 2 shows a block diagram of a member of the upper set of read sense amplifiers 174U. The lower set of read sense amplifiers 174L is similar. The upper read sense amplifier 174 U depicted is labeled USA_UNIT. Each of the upper and lower read sense amplifiers has two stages of amplification, a sense pre-amplifier 274U and a sense output amplifier 295U. Thus, USA_UNIT comprises upper sense pre-amplifier 274 U labeled Ucascodr and upper sense output amplifier 295U labeled Usamp. The data output 296U, or sense amplifier output sign from the upper sense output amplifier 295U labeled UDSIBn is communicated to multiplexer 180. The inputs of the upper sense output amplifier 295U derive, at least in part, from the upper sense pre-amplifier 274U and are labeled USAin 279U. It will be appreciated that the upper sense output amplifier 295U may have additional inputs not derived from the upper sense pre-amplifier 274U.

FIG. 2 also depicts several input signals for the upper sense pre-amplifiers 274U. The data bit line signals 275U, labeled UDATABn, communicate bit line information to the upper sense pre-amplifiers 274U from the upper bank 194 via the multiplexers 170.

FIG. 2 also depicts reset input signals labeled URSTR 276U which reset the upper data bit lines to ground before the onset of the amplification function of the sense pre-amplifiers 274U and which derive from portions of the circuitry not shown. The power down input signals labeled UqBPDSA 277U turn off the power to the upper sense pre-amplifier 274U when not in use.

The internal circuit structure of the upper sense pre-amplifier 274U labeled Ucascodr is shown in FIG. 3. The operation of the internal circuit structure of the upper sense pre-amplifier 274U is discussed in detail in the co-pending and commonly assigned U.S. Patent Application which is herein incorporated by reference: Ser. No. 09/421/985, “LOW VOLTAGE READ CASCODE FOR 2V/3V AND DIFFERENT BANK COMBINATIONS WITHOUT METAL OPTIONS FOR A SIMULTANEOUS OPERATION FLASH MEMORY DEVICE, filed Oct. 19, 1999. All the n-mos transistors are mid-voltage transistors with a gate oxide thickness that can tolerate up to about 12 volts applied between gate and source without damage or significant leakage; the n-mos transistors are marked “mv” to describe this characteristic. The two p-mos transistors 318 and 319 are not so marked because their gate oxide thickness can support only a low voltage of up to about 3.5 volts. Transistors 311, 314 and 316 are marked with a “Z” to indicate that they have approximately a zero gate threshold voltage V_(t).

Transistors marked with a “o” in the oxide but without a “Z” mark, 313 and 315, are called intrinsic transistors and have a gate threshold voltage V_(t) of about 0.4 volts. In several embodiments voltage V_(cc) will range between about 2 volts (“low voltage”) and about 3 volts (“high voltage”), a considerable range.

FIG. 4 depicts one of the upper or lower sense output amplifiers 295U. The sense output amplifier 295U operates in a differential mode comparing reference signal USAref 278U from a reference sense amplifier circuit USArefr 211 and signal from the upper sense pre-amplifier 274U. Some of the Z-type transistors function as a current source. These Z-type transistors facilitate circuit operation with a low voltage V_(cc), e.g., 2-3 volts, to enable desired transistor turn-on.

Transistor 610 serves the advantage of conserving power by shutting off the current source when the power-down signal UqBPDSA 277U is asserted.

One reference sense amplifier circuit USArefr 211 must provide the USAref reference signal to 16 USAMPs, a large fan-out. That fan-out creates such large capacitive loading on USArefr 278U that the input transistors 414 and 415 receiving signal USAref as well as USAin 279U must be nearly as small in area as the fabrication technology allows, i.e., both W and L in the W/L ratio should separately be small to reduce transistor gate capacitance. To account for the same capacitive concerns, capacitor 430 is connected between USAin 279U and ground to match the capacitive loading between USAin 279U and USAref 278U. For enhanced speed in general, the channel length of the transistors should be made as small as fabrication technology allows.

To achieve good differential operation, certain pairs of transistors are matched to other pairs. The transistors pair 414, 416 is matched to the pair 415, 417. In addition, transistors 412 and 411 are matched to each other.

The last stage of the amplifier includes an equalization circuit turned on by signal UEQ which equalizes the input and output stages of the sense output amplifier. The latch stage 440 of the amplifier operates with an approximately middle V_(cc) trip point. The trip point of the EQ stage is set at about ½ V_(cc) to realize fast operation time. The latch stage 440 turn-off of the inputs USAref 278U and USAin 279U and all the stages of the sense output amplifier 295U prior to the latch 440 as soon as the latch stage 440 latches the data. Significant power savings result from this latching feature.

For the best operation of the output sense amplifier 295U, parasitic loads at nodes CSO, DOO and LOO should be minimized, e.g., by use of metal, if possible.

It is to be noted that suitable transistor sizes specifying channel width to length ratios (measured in micrometers or microns) for the transistors which make up the depicted circuits have been omitted from the figures. It will be appreciated that suitable ratios may be chosen depending on the design requirements and the capabilities and limitations of the particular integrated circuit fabrication process used for implementation of the circuit as well as the performance requirements of the specific embodiment.

It is therefore intended that the foregoing detailed description be regarded as illustrative rather than limiting, and that it be understood that it is the following claims, including all equivalents, that are intended to define the spirit and scope of this invention. 

What is claimed is:
 1. A memory device comprising a bank of memory cells; a sense pre-amplifier coupled to the memory bank, said sense pre-amplifier generating a sense pre-amplifier output signal; a sense output amplifier coupled to said sense pre-amplifier, said sense output amplifier generating a sense amplifier output signal in response to a differential comparison between said sense pre-amplifier output signal and a reference signal; further comprising a reference sense amplifier circuit coupled to said sense output amplifier thereby generating said reference signal; further comprising a plurality of said sense output amplifiers and one of said reference sense amplifier circuit, wherein said reference sense amplifier circuit is coupled to each of said plurality of said sense output amplifiers with a plurality of circuit lines thereby directing said reference signal to each of said plurality of said sense output amplifiers; and wherein each of said sense output amplifiers comprise an input transistor coupled to said reference signal and another input transistor coupled to said sense pre-amplifier output signal, said input transistors having a small channel width and channel length thereby reducing transistor gate capacitance.
 2. A memory device comprising a bank of memory cells; a sense pre-amplifier coupled to the memory bank, said sense pre-amplifier generating a sense pre-amplifier output signal; a sense output amplifier coupled to said sense pre-amplifier, said sense output amplifier generating a sense amplifier output signal in response to a differential comparison between said sense pre-amplifier output signal and a reference signal; further comprising a reference sense amplifier circuit coupled to said sense output amplifier thereby generating said reference signal; further comprising a plurality of said sense output amplifiers and one of said reference sense amplifier circuit, wherein said reference sense amplifier circuit is coupled to each of said plurality of said sense output amplifiers with a plurality of circuit lines thereby directing said reference signal to each of said plurality of said sense output amplifiers; and wherein said sense pre-amplifier output signal is coupled to a capacitor, said capacitor also being coupled to a ground thereby matching the capacitive loading of said reference signal.
 3. A memory device comprising a bank of memory cells; a sense pre-amplifier coupled to the memory bank, said sense pre-amplifier generating a sense pre-amplifier output signal; a sense output amplifier coupled to said sense pre-amplifier, said sense output amplifier generating a sense amplifier output signal in response to a differential comparison between said sense pre-amplifier output signal and a reference signal; further comprising a reference sense amplifier circuit coupled to said sense output amplifier thereby generating said reference signal; and wherein each of said sense output amplifiers comprise a transistor pair corresponding to said reference signal and another transistor pair corresponding to said sense pre-amplifier output signal, said transistor pairs matching each other thereby enhancing differential operation.
 4. A memory device comprising a bank of memory cells; a sense pre-amplifier coupled to the memory bank, said sense pre-amplifier generating a sense pre-amplifier output signal; a sense output amplifier coupled to said sense pre-amplifier, said sense output amplifier generating a sense amplifier output signal in response to a differential comparison between said sense pre-amplifier output signal and a reference signal; and wherein said sense output amplifier comprises an equalization circuit coupled to an input stage and an output stage thereby equalizing said input stage and said output stage, wherein said equalization circuit has a trip point set at about half V_(cc).
 5. The memory device according to claim 4, wherein said sense output amplifier comprises a latch stage adapted to store said sense amplifier output signal, said latch stage being coupled to said equalization circuit.
 6. The memory device according to claim 4, wherein said sense output amplifier comprises transistors adapted to disconnect said latch stage from said reference signal and said sense pre-amplifier output signal after said sense amplifier output signal is stored in said latch stage thereby allowing said reference signal and said sense pre-amplifier output signal to be turned off and thereby saving power.
 7. A memory device comprising a bank of memory cells; a sense pre-amplifier coupled to the memory bank, said sense pre-amplifier generating a sense pre-amplifier output signal; a sense output amplifier coupled to said sense pre-amplifier, said sense output amplifier generating a sense amplifier output signal in response to a differential comparison between said sense pre-amplifier output signal and a reference signal; wherein said sense pre-amplifier and said sense output amplifier form a two stage sense amplifier, further comprising two of said memory banks and two of said two stage sense amplifiers, wherein one of said two stage sense amplifiers corresponds and is coupled to one of said memory banks and the other two stage sense amplifier corresponds and is coupled to the other memory bank; further comprising a plurality of said two stage sense amplifiers coupled to said corresponding memory banks, said plurality corresponding to a number of data outputs; further comprising a reference sense amplifier circuit coupled to said sense output amplifier thereby generating said reference signal; wherein one of said reference sense amplifier circuit is coupled to each of a plurality of said sense output amplifiers with a plurality of circuit lines thereby directing said reference signal to each of said plurality of said sense output amplifiers; wherein said sense output amplifier comprises an equalization circuit coupled to an input stage and an output stage thereby equalizing said input stage and said output stage; wherein said sense output amplifier comprises a latch stage adapted to store said sense amplifier output signal, said latch stage being coupled to said equalization circuit; further comprising a reset input signal coupled to said sense pre-amplifier, said reset input signal resetting said memory bank to ground before amplification; further comprising a power down input signal coupled to said sense pre-amplifier, said power down input signal turning off said sense pre-amplifier when not in use; wherein said sense output amplifier comprises transistors adapted to disconnect said latch stage from said reference signal and said sense pre-amplifier output signal after said sense amplifier output signal is stored in said latch stage thereby allowing said reference signal and said sense pre-amplifier output signal to be turned off and thereby saving power; wherein each of said sense output amplifiers comprise an input transistor coupled to said reference signal and another input transistor coupled to said sense pre-amplifier output signal, said input transistors having a small channel width and channel length thereby reducing transistor gate capacitance; wherein said sense pre-amplifier output signal is coupled to a capacitor, said capacitor also being coupled to a ground thereby matching the capacitive loading of said reference signal; and wherein each of said sense output amplifiers comprise a transistor pair corresponding to said reference signal and another transistor pair corresponding to said sense pre-amplifier output signal, said transistor pairs matching each other thereby enhancing differential operation. 